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I.  Introduction 


Consider  the  multi -input  bilinear  system 


A = [a  + i B u ] 
i=l  ^ ^ 


x(t  ) = X e R 
o o 

where  (A,B  ••B  ) are  nxn  constant  matrices  and  u = (u  ••u  ) is  a vector 
1 m 1 m 

of  control  inputs  with  each  component  u^(t)  assumed  to  be  a square  integrable 

function  on  every  finite  time  interval,  t $ t $ t, . A subclass  of  the 

o 1 

system  (1)  is  the  "commutative  bilinear  system"  in  which  each  pair  of  matrices 
in  the  set  (A,B  ••B  ) commute,  i.e.  AB.  = B.A  and  B.B.  = B.B,  for  all 
i,j.  Sussmann  [1]  has  shown  that  the  attainable  set  for  this  subclass  is 
closed  relative  to  "bang-bang"  control  functions.  Baras  and  Hampton  [2]  ex- 
tended this  result  to  a class  of  delayed  commutative  bilinear  systems. 

In  this  paper,  a minimum  energy  control  problem  for  commutative  bilinear 
systems  is  studied  both  with  and  without  endpoint  constraints  on  the  state. 

It  is  shown  that  a given  terminal  state  x(t^)  = x^^  is  reachable  at  some 
time  t^  > t^  if  and  only  if  it  is  constant  reachable,  i.e.  reachable  using 
a constant  input  function.  This  suggests  that  the  optimal  control  for  a 
fixed  terminal  minimum  energy  regulation  problem  is  simply  a constant  vector. 

In  addition  to  verifying  this  supposition,  the  uniqueness  of  solutions  is 
studied  for  the  regulator  problem  without  an  end  point  constraint  and  a class 
of  physical  dynamic  systems  is  delineated  which  possess  the  commutative 


property. 


-2- 


II.  A Commutative  Bilinear  Physical  System 

The  absolute  velocity  for  the  motion  of  a particle  in  a moving  co- 
ordinate system  is  given  by  the  vector  equation  (eg.  see  [3]) 

V = R + (P_)p  + u)^  X (2) 

where  R is  the  absolute  velocity  of  the  origin  of  the  moving  coordinate 

system,  p is  the  position  vector  of  the  particle  relative  to  the  moving 

origin,  u is  the  angular  rotation  rate  of  the  moving  coordinate  system 

and  (p ) is  the  velocity  of  the  particle  relative  to  an  observer  fixed  in 
— r 

the  moving  coordinate  system.  If  the  particle  is  associated  with  a target 
"T"  and  the  moving  origin  with  a pursuer  "P" , and  if  motion  is  confined  to 
the  (x,y)  plane,  then  the  kinematic  equations  for  this  motion  can  be 
derived  as: 


X = -v_  sin  3 + u y 
T p 


y = v„  cos  3 - u X - V 
T p p 


B = - Uj, 


where  (Up,u^)  are  the  angular  rates  of  the  pursuer  and  target  with  respect 
to  a nonrotating  reference  frame,  line  speeds  of  the  pur- 

suer and  target,  (x,y)  are  the  horizontal  and  vertical  distances  of  the 
target  relative  to  the  pursuer,  and  3 is  the  relative  angle  between  the 
headings  of  the  target  and  pursuer. 

The  kinematic  equations  (3)  remain  valid  when  all  quantities  (u^ .v^ ,u^,v^) 
vary  with  time.  These  equations  arise  in  a variety  of  pursuit-evasion  type 
problems  such  as  the  "two  car"  and  "homicidal  chauffer"  games  of  Isaacs  [4], 

In  deriving  (3)  from  (2)  a unit  triad  is  defined  at  P so  that 

p_  = x£  + yj_,  ^ = Vpj_  and  at  all  times. 


L 
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i 


aerial  combat  between  two  aircraft  [5]  and  a missile  intercept  problem  [6]. 


In  order  to  associate  a commutative  bilinear  control  system  with  (3)  we 
assume  that  the  pursuer  can  manipulate  both  its  speed  and  its  angular 

acceleration  u^,  and  that  vanishes  only  at  pcints  of  measure  zero 

over  the  time  interval  of  interest.  Then  there  is  no  loss  of  generality  in 
postulating  the  existence  of  a scalar  valued  function  Y(t)  such  that 

V (t)  = Y(t)u  (t).  (4) 

P P 

Defining  state  variables  = x,  = y and  = B for  (3),  and  introducing 
three  auxiliary  states  according  to  Xj^  = sin  x^  , = cos  x^  and  x^  = 1, 

(3)  in  combination  with  (4)  is  seen  to  be  equivalent  to  the  bilinear  system 


X = Ax  + Bxu 


x(t  ) = Col[x, (t  ),x„(t  ),x  (t  ),sin  x,(t  ),cos  x,(t  ),1] 

O XO^OvO  wO  %30 


(5-a) 


0 0 0 -v^  0 0 
0 0 0 0 v„  0 


0 1 0 0 0 0 

-1  0 0 0 0 -Y 


0 0 0 


0 u^  ,B  = 0 0 0 0 0 -1 


(5-b) 


0 0 0 0 u^  0 

0 0 0 -u^  0 0 

0 0 0 0 0 0 


0000-1  0 
0 0 0 1 0 0 

0 0 0 0 0 0 


in  which  u = u^  is  regarded  as  the  primary  control  variable.  It  can  be 
readily  verified  that  A and  B commute  for  all  (u^,v^,y),  i.e. 


0 0 0 0 v^  0 

0 0 0 v^  0 0 

AB=  DA  =000000 
0 0 0 u^  0 0 

0 0 0 0 u^  0 

0 0 0 0 0 0 
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and,  moreover,  this  commutative  property  is  independent  of  Y(t)  = Vp(t)/Up(t). 

An  important  problem  for  ( 5 ) is  the  determination  of  feasible  controls 
for  the  intercept  condition: 


x^(t^)  = ^^(t^)  = 0 for  some  t^  > t^. 


(6) 


given  the  initial  condition  x(t^)  = and  the  target  parameters  (u^,v^). 
If  some  leeway  is  allowed  for  the  intercept  angle  Y 

is  regarded  as  a parameter  subservient  to  the  terminal  constraint  (6),  then 
a solution  exists  for  this  problem  in  which  the  control  is  given  by  the  con- 
stant function 


= “t  ' 

1 o 


• * 'r 


(7) 


Integral  to  this  solution  is  the  fact  that  the  parameters  (6^,y)  are  easily 

obtained  as  solutions  to  a certain  transcendental  equation  after  which  t^ 

is  given  explicitly  as  a function  of  and  the  initial  conditions.  The 

details  of  this  solution  are  given  in  Section  IV  of  [7],  along  with  a closed 

form  least  squares  estimate  of  the  parameters  (u  ,v  ,x  (t  )),  as  part  of  a 

1 i w o 

proposed  feedback  control  law  for  a missile  intercept  system. 

The  question  arises  as  to  the  interpretation  of  the  control  (7)  which 

drives  the  commutative  bilinear  system  (5)  from  an  arbitrary  initial  state 

<to  the  terminal  state  (6)  in  some  time  interval  t ? t J t, . The  results 

o 1 

I 

of  the  next  section  will  demonstrate  that  this  simple  control,  which  is  rela- 
tively easy  to  compute  and  implement,  is  optimal  in  the  minimum  control  energy 
sense. 


Ill . A Minimum  T;nergy  Control  Problem 

The  following  definition  of  reachability  is  pertinent  since  we  are  deal- 


ing with  a nonlinear  control  system. 
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Definition:  The  state  vectors  (x^,x^)  constitute  a reachable  pair  for 
the  bilinear  system  (1)  if  there  exists  a square  integrable  control  signal 


u(t)  which  accomplishes  the  transition  from  x = x(t  ) to  x,  = x(t, ) 

O O X i 


over  some  finite  time  interval  t $ t s t, . Likewise,  the  pair  (x  ,x, ) 

o 1 > r o 1 

is  said  to  be  constant  reachable  if  it  is  reachable  for  a constant  control 
signal. 


In  the  case  of  a time  invariant  commutative  bilinear  system  in  which 


the  matrices  commute  pair-wise,  the  solution  to  (1)  corresponding 


to  any  square  integrable  control  signal  u(t)  on  ^ ^ ^ ^1  written 

as  (see  Sussmann  [1]) 


m 


x(t,)  = e ^1  o ^.(t  ,t  )x 
1 . , 1 1 o o 

1=1 


(8) 


At 


where  e and  <!>.(t,t^)  are  the  state  transition  matrices  for  the  nxn 


matrices  A and  B^u^Ct),  respectively.  The  following  result  is  then  easily 


obtained . 

Assertion  A pair  (x^,x^)  for  a time  invariant  commutative  bilinear  system 
is  reachable  if  and  only  if  it  is  constant  reachable. 

Proof:  The  sufficiency  part  of  the  proof  is  true  by  definition.  The  neces- 
sity part  follows  upon  defining  a constant  control  signal  according  to  the 
average  value 


1 P 
"c  ' vT  I 

1 O ■'  t 


(9) 


If  the  pair  (x  ,x, ) is  reachable  using  a variable  control  u(t)  on  t s t < t , 
o 1 o 1 

then  (8)  and  (9)  imply 


s 

i i 


1 i 

( i 


(10) 


A(t,-t  ) 
= e 1 o 


m 

n € 

i=l 


B.u 
1 c 


which  is  just  the  state  response  at  time  t^^  using  the  constant  control  (9). 
This  verifies  the  Assertion. 

The  above  result  can  be  slightly  generalized  to  a class  of  time  varying 
commutative  bilinear  systems  of  the  form 

m 

x(t)  = [A(t)  + y B.a. (t)u. (t)]x(t) 

11  1 

1=1 

where  the  matrices  (A(t),B  ..B  ) commute  pair-wise  for  all  t and  (x  ,x, ) 

i m o 1 

is  a reachable  pair  corresponding  to  a certain  control  u(t)  on  (t  ,t, ). 

o 1 

In  this  case  the  pair  (x^,x^)  is  reachable  using  the  constant  control 


1-1 


a.(t)dt 

1 


[ ^ a.(t)u.(t)dt, 
Jt  ^ i 
o 


assuming  that  the  given  scalar  functions  a^(t)  are  such  that  their  integrals 


do  not  vanish  over  the  time  interval  (t  ,t, ) . 

o 1 


A.  Control  V/ithout  Terminal  Constraints 

Consider  the  quadratic  cost  over  a fixed  time  interval 


Jj^(u)  = x' (tj^)Fx(t^)  + 


u'(t)Ru(t)dt 


(11) 


where  P and  R are  symmetric  nonnegative  definite  constant  weighting  matrices 
with  R > 0.  The  optimal  controls  for  (1)  which  minimize  (11)  must  satisfy 
the  following  canonical  equations  from  the  maximum  principle : 


X*' 


[A  + 


B,u*]x* 
1 1 


X*(t  ) = X 
o o 


P"  = -[A  + y BIuvlp" 

1=1 


p*(t^)  = -rx*(t^) 


P’BiP* 


Ix^«-b;p^:J 


Assuming  commutativity  between  pairs  of  matrices  in  the  set  (A,B  '‘B  ) it 

1 m 


can  be  verified  directly  that  ~ (t)  = 0 for  all  te(t  ,t,).  FuttinE 

at  o 1 

t = t^  in  u*(t),  utilizing  the  boundary  condition  for  p*(tj^)  and  the  relation 


(10),  it  follows  that  the  constant  vector  u’'‘,  if  it  exists,  must  satisfy  the 


nonlinear  transcendental  eauation 


I . A'(t  -t  ) n “i^i^^i 
x'e  1 o , , e 


Bluv(t,-t  ) ^ B.uv(t,-t  ) A.(t,-t_)  1 


B,’F  ,1,  e 


1 1 1 o 1 o 


1 i=l 


1 D-1 

u-  = - ^ R 


\ n 3 uV(t,-t  ) „ B.uv(t  -t  ) A(t  -t  ) 

,A'(t-t)fI  11  1 On,-  ^ 11  1 o 1 o 

x'e  1 o . , e B'F  . , e e x 

o 1=1  m 1=1  c 


A -G(u*). 


The  existence  of  optimal  controls  can  be  established  independently  of 


the  commutativity  condition  through  the  study  of  the  properties  of  the  attain- 


able set  for  (1)  and  (21)  corresponding  to  all  souare  integrable  input  functions. 


The  details  of  this  study  can  be  found  in  Chapter  3 of  fS],  Therefore,  any  con- 


clusions based  on  the  necessary  conditions  of  the  maximum  principle  are  valid. 


Tlicse  results  are  summarized  in  the  following  theorem  together  with  a suffi- 


I 


cient  condition  for  uninueness. 


Theorem  1:  The  optimal  controls  for  a time  invariant  commutative  bilinear 

system  with  the  quadratic  cost  (11)  are  constant  vectors  u*  satisfying 

the  transcendental  equation  (12).  The  vector  u*  is  unique  if  the  followinp 

matrix  valued  function  Z(v  ) = (Z..)  is  nonnecative  definite  for  all  v cP.": 

o i:  o 


vTB'.BlF  + b:FB.]v 
o j 1 1 j o 


= l,2**m. 


For  a single  input  commutative  bilinear  system  (m  = 1),  unicueness  is 
guaranteed  for  the  simpler  nonnegativity  condition; 

Ej^^F  + Bj^FB^  > 0. 

Proof  Since  the  characterization  of  U"  by  (12)  has  already  been  established, 

\te  consider  the  uniqueness  of  solutions  to  U"  +G(u*)  = 0,  where  the  operator 
G is  defined  by  the  right  side  of  (12).  From  the  theory  of  monotone  operators 
(see  Minty  [9])  it  is  knov.-n  that  the  equation  u + G(u)  = 0 lias  a unique  solu- 
tion if  G is  a monotonically  increasing  map.  If  G is  Frechet  differentiable, 
a sufficient  condition  for  this  is  that  the  Frechet  derivative  operator  G’(u) 
bo  nonnegative  definite.  By  direct  computation  the  Frechet  differential  of 
G at  U"  with  increment  h is : 

dG(u*;h)  = i R"^Z(x  )*h,  h e R*" 

^ o 


where 


A'(t,-t  ) m B'ug(t,-t  ) 
Z,.  = x'e  ^ ° n e ^ ° 


'ug(t,-t  ) m B U"(t,-t  ) A(t,-t  ) 

^ ° [B-.B.F  t BIFD.]  n e ^ ° e ^ ° x 

3 ^ k=l 


By  hypothesis,  the  matrix  defined  by  (13)  Iv  nonnegative  definite,  which 

implies  that  the  matrix  7.(x^)  above  is  also  nonnegative  definite.  In  turn, 
this  assures  the  monotonicity  of  G (see  Vainberg  [10]). 
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If  the  more  general  quadratic  cost 


J(u)  = x'  (tj^)Fx(t^)  + 


^ Cx'(t)Qx(t)  + u' (t)Ru(t)]dt 


t 

o 

is  prescribed  in  order  to  weight  the  state  trajectory  x(t)  through  the 
symmetric  nonnegative  definite  matrix  Q,  then  the  optimal  controls  satisfy 


m 

X*  = [A  + B,u*]x* 

i=l  ^ ^ 


x*(t  ) = X 
o o 


m 

p*  = -[A'  + I B!u*]p*  + 2Qx*  P*(t  ) = -Fx*(t  ) 

i=l  ^ ^ 


1 


u*  = =■  R 


fx^'Bj^p* 


x*'B  p- 


In  this  case,  the  control  derivative,  ^ , becomes  (again,  using  commutativity) 


'x*’(t)QBj^x*(t)' 

x*'(t)QB  x*(t) 
m ' 


^ 0. 


Although  this  control  represents  a simple  pure  integration  on  quadratic  opera- 
tions involving  the  state,  it  is  considerably  more  difficult  to  compute  since 
the  boundary  condition  is  not  characterized  by  an  explicitly  defined  algebraic 
equation  as  in  (12). 


B.  Control  With  Terminal  Constraints 


We  now  consider  the  minimum  control  energy  problem  for  a commutative 
bilinear  system  subject  to  a fixed  terminal  state  x(t^)  = x^. 

Theorem  2:  Given  a time  invariant  commutative  bilinear  system  and  a reachable 
pair  (x^,x^),  the  optimal  controls  for  the  cost  function 

A 

J^Cu)  = u'(t)Ru(t)dt  , R > 0 (14) 

^o 


are  constant  vectors  u*  satisfying  the  transcendental  equation 


A(t,-t  ) m B.U''(t.-t  ) 

1 o „ 1 1 1 o 

e He  X 

i=l 


(15) 


Proof:  From  the  Assertion  it  is  known  that  if  the  pair  (x^,Xj^)  is  reachable 

using  some  control  u(t),  t ^ t ^ t, , then  it  is  reachable  using  the  con- 

o 1 

stant  control  given  by  (9).  Therefore,  comparing  the  costs  associated  with 
these  two  control  functions 

U,  = u'Ru  dt  = u'Ru  (t,-t  ) 

1 J c c c c 1 o 

t 

o 

A 

Uj  = u' (t)Ru(t)dt, 
t 

o 

it  follows  from  Holder's  inequality  applied  to  relation  (9)  that 
with  equality  if  and  only  if  u(t)  = a constant  vector  on  (t^,t^).  Equa- 
tion (15)  is  simply  (10)  rewritten.  This  proves  the  theorem. 
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Brockett  [11]  obtained  a (non-constant)  solution  to  the  minimum  control 
energy  problem  with  a fixed  terminal  state  in  the  case  of  nxn  matrix  state 
commutative  bilinear  systems  with  det  X / 0.  By  contrast,  the  solution  here 
to  the  vector  state  case  has  been  shown  to  be  simply  a constant  control  vector. 

The  following  simple  example  illustrates  the  nonuniqueness  of  optimal 
controls  for  a minimum  energy  problem  with  a fixed  terminal  constraint. 

Example ; Given  the  commutative  bilinear  second  order  system 

x^  = Xj^  + ux^  , x^(0)  = 0 

x^  = x^  - ux^  , x^CO)  = 1 

together  with  the  cost  to  be  minimized  and  the  terminal  constraint: 

fl  , X (1)  = 0 

J(u)  = I u (t)dt  , 

•'o  x^Cl)  = -e. 

It  can  be  seen  that  (A,Bj^)  commute  and  that  (x(0),x(l))  is  a reachable 
pair.  Hence,  by  Theorem  2 the  optimal  controls  satisfy  (15)  applied  to  this 
example : 

0 = e sin  u* 

-e  = e cos  u*. 

Solving  for  u*  we  obtain  u*  = kir,  k = ±1,±3«»,  and  in  order  that  J 
be  minimized  it  follows  that  u*  = it  and  u”  = -ir  are  both  optimal. 

IV.  Concluding  Remarks 

Although  commutative  bilinear  systems  form  a very  small  subset  of  bilinear 
systems,  there  nevertheless  exists  an  important  class  of  physical  systems  for 
which  the  condition  of  commutativity  is  upheld.  The  minimum  energy  controls 
for  this  class  relative  to  the  costs  (11)  and  (1*4)  have  particularly  simple 
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forms  as  constant  functions  determined  by  the  boundary  conditions.  Although 
these  controls  are  often  nonunique,  any  lack  of  uniqueness  can  possibly  be 
exploited  in  other  ways.  For  example,  the  intercept  angle  6 = x_(t,)  in 

the  missile  intercept  problem  discussed  briefly  in  Section  II  can  be  traded 
off  against  the  scaling  parameter  y in  obtaining  physically  meaningful  solu- 
tions for  the  pursuer  speed  and  acceleration  (Vp,Up),  as  shown  by  the  simu- 
lations carried  out  in  [7]. 

In  terms  of  general  bilinear  systems , where  A = AB  - BA  0 , the  rela- 


tively simple  solution  for  the  commutative  case  might  be  used  as  a nominal  or 
generating  solution  for  obtaining  minimum  energy  controls  in  a power  series  ex 
pension  of  the  commutator  A.  These  considerations  remain  to  be  explored. 
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